Picture for Yang Zhou

Yang Zhou

Yahoo! Labs

Evaluating LLMs When They Do Not Know the Answer: Statistical Evaluation of Mathematical Reasoning via Comparative Signals

Add code
Feb 03, 2026
Viaarxiv icon

Unrewarded Exploration in Large Language Models Reveals Latent Learning from Psychology

Add code
Jan 30, 2026
Viaarxiv icon

Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes

Add code
Jan 29, 2026
Viaarxiv icon

Aligning Medical Conversational AI through Online Reinforcement Learning with Information-Theoretic Rewards

Add code
Jan 25, 2026
Viaarxiv icon

PUNCH: Physics-informed Uncertainty-aware Network for Coronary Hemodynamics

Add code
Jan 23, 2026
Viaarxiv icon

RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation

Add code
Jan 13, 2026
Viaarxiv icon

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Add code
Jan 05, 2026
Viaarxiv icon

CTIS-QA: Clinical Template-Informed Slide-level Question Answering for Pathology

Add code
Jan 05, 2026
Viaarxiv icon

DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving

Add code
Jan 04, 2026
Viaarxiv icon

Explainability-Guided Defense: Attribution-Aware Model Refinement Against Adversarial Data Attacks

Add code
Jan 02, 2026
Viaarxiv icon